When do libraries of articulated images obey an isometric embedding?

نویسنده

  • David L. Donoho
چکیده

in image understanding and image coding, it could be useful to ‘learn’ the structure of such image articulation manifolds and to recover the underlying parameters (location, scale, etc.) from unlabeled data. This could be important for recognizing articulated vehicles in target recognition, and for understanding articulated faces in facial recognition. The general problem of learning the shape of a manifold from scattered observations has been around for a long time; it has been the source of many multivariate techniques, including principal components analysis, independent components analysis, multidimensional scaling, selforganizing mappings, and other important methodological developments. Recently, Tenenbaum et al. [9] proposed the Isomap procedure as a general tool for recovering the unknown parametrization underlying a set of digital images, {Ii}, of faces in various attitudes and articulations. The general principle of Isomap is to measure distance between images, not using Euclidean distance (which is ignorant of the manifold structures), but using distance according to the shortest path in the nearest neighbor graph; and to use this graph distance as input to a classical “principal coordinates” multidimensional scaling procedure. 1.1 Validating Isomap Tenenbaum et al. [9] published a few interesting examples, for example mapping out the parameters underlying a face seen from a variety of viewpoints. These empirical successes lead to the ... Obvious Question: how “correct” is the Isomap procedure; does it really recover the “true” underlying parametrization of families of articulated images? This leads immediately to the ... Obvious Approach: Test Isomap for synthetic data where we know a priori “the natural” parametrizations and see if it can recover the parametrization of image manifolds. In following the ‘obvious’ approach, we would construct datasets of artificial images undergoing standard articulations – translations, rotations, etc. – and ask if the Isomap parametrization correctly discovers the underlying parametrizations. While the question and the investigative approach seem clear at first glance, on closer inspection, it is not clear that one can push this line of investigation very far. There are several reasons a purely empirical investigation based on running Isomap on artificial examples may not be enlightening. • Sampling issues. Whether Isomap works or not might depend on how many model images Ii are in the database. In some vague sense, it will be important for these images to be well-distributed across the image manifold, but exactly what this means in a specific instance is unclear. Consequently, if Isomap ‘fails’, this may be due to poor data sampling rather than any intrinsic property of Isomap. • Digitization issues. In some sense, the fact that images are discretized into pixels makes them ‘noisy’ / ‘blocky’; this again causes some difficulty in interpreting ‘failure’ of Isomap – is it due to the pixelization or is it intrinsic to the type of articulation? • ‘Big Picture’ issues. Empirical work really doesn’t give us an intellectual framework that we can leverage into other settings, at least not the kind of framework that might be possible by a more theoretical approach. For these and other reasons, we propose to develop an alternate framework for understanding Isomap. We think of an image as a function I(x) of a continuous variable x ∈ R. We consider articulations of a base image I0, producing a family of images {Iθ : θ ∈ Θ}, where θ is the parameter of the articulation and Θ is the parameter space. Examples we will consider include translation families, where Θ = R, Iθ(x) = I0(x − θ). In this “continuum” viewpoint, neither sampling nor digitization can cause problems, and a clear intellectual framework exists naturally.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Steganalysis of embedding in difference of image pixel pairs by neural network

In this paper a steganalysis method is proposed for pixel value differencing method. This steganographic method, which has been immune against conventional attacks, performs the embedding in the difference of the values of pixel pairs. Therefore, the histogram of the differences of an embedded image is di_erent as compared with a cover image. A number of characteristics are identified in the di...

متن کامل

تجزیه‌ ی تُنُک تصاویر ابرطیفی با استفاده از یک کتابخانه‌ ی طیفی هرس شده

Spectral unmixing of hyperspectral images is one of the most important research fields  in remote sensing. Recently, the direct use of spectral libraries in spectral unmixing is on increase. In this way  which is called sparse unmixing, we do not need an endmember extraction algorithm and the number determination of endmembers priori. Since spectral libraries usually contain highly correlated s...

متن کامل

A Theory for Optical flow-based Transport on Image Manifolds

An image articulation manifold (IAM) is the collection of images formed when an object is articulated in front of a camera. IAMs arise in a variety of image processing and computer vision applications, where they provide a natural lowdimensional embedding of the collection of high-dimensional images. To date IAMs have been studied as embedded submanifolds of Euclidean spaces. Unfortunately, the...

متن کامل

An extended feature set for blind image steganalysis in contourlet domain

The aim of image steganalysis is to detect the presence of hidden messages in stego images. We propose a blind image steganalysis method in Contourlet domain and then show that the embedding process changes statistics of Contourlet coefficients. The suspicious image is transformed into Contourlet space, and then the statistics of Contourlet subbands coefficients are extracted as features. We us...

متن کامل

Intelligent scalable image watermarking robust against progressive DWT-based compression using genetic algorithms

Image watermarking refers to the process of embedding an authentication message, called watermark, into the host image to uniquely identify the ownership. In this paper a novel, intelligent, scalable, robust wavelet-based watermarking approach is proposed. The proposed approach employs a genetic algorithm to find nearly optimal positions to insert watermark. The embedding positions coded as chr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003